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SIGNIFICANCE  AND  EXPLANATION 


?■ 

The  central  limit  theorem  is  a  fundamental  theorem  in  probability  and 
statistics.  It  states  that  the  probability  distribution  of  the  sum  of  a  large 
number  of  small  and  mutually  independent  random  numerical  observations 
approaches  a  normal  distribution  as  the  number  of  observations  increases.  The 
Lindeberg  condition  is  a  condition  for  which  the  central  limit  theorem  holds. 
It  has  been  proved  to  be  both  necessary  and  sufficient. 

A  PoincarS-type  inequality  is  an  inequality  which  relates  the  integral  of 
the  square  of  a  function  to  the  integral  of  the  square  of  its  derivative.  In 
this  report  we  give  a  new  proof  of  the  central  limit  theorem  by  using 
Poincare-type  inequalities  to  prove  both  the  necessity  and  sufficiency  of  the 
Lindeberg  condition. 
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THE  CENTRAL  LIMIT  THEOREM  AND  POINCARE-TYPE  INEQUALITIES 


Louis  H.  Y.  Chen 

Borovkov  and  Utev  (1984)  defined  the  functional 

o(x>  -  «p 

q  u'llq'l*)] 

for  any  random  variable  X  with  finite  variance  a  ,  where  the  supremum  is 
taken  over  the  class  of  absolutely  continuous  functions  g  such  that  0  < 
Var[g(X)]  <  ».  They  proved  that 

(1)  U( X)  >  1  and  if  U(X)  ■*  1  then  X  has  a  normal  distribution. 

Using  this  result  they  further  proved  that 

(2)  if  x1fx2r...  is  a  sequence  of  random  variables  such  that  U(Xn)  ♦  1 


then  the  moment  generating  function  of  (XR  -  EXn)/[Var(Xn)] exists 


and  converges  to  that  of  the  standard  normal  random  variable  in  a 
neighborhood  of  zero . 

It  is  natural  to  ask  if  ( 1 )  can  also  be  applied  to  prove  the  central 
limit  theorem  under  the  Lindeberg  condition.  This  question  was  in  fact  raised 
hy  S.  Kotani  in  private  communication  with  the  author  and  motivated  the 
present  work. 

The  existence  of  the  moment  generating  function  of  Xjj  in  (2)  is  due  to 


the  finiteness  of  U(Xn)  (see  Borovkov  and  Utev,  Theorem  2).  Since  the 


central  limit  theorem  does  not  require  such  a  strong  condition,  arguments 
different  from  those  of  Borovkov  and  Utev  would  have  to  be  used.  It  turns  out 
that  a  Poincar6-type  inequality  for  sums  of  independent  random  variables 
proved  along  the  same  line  as  in  Chen  (1985,  Section  2)  is  the  key  to  this 
problem.  Using  this  inequality  we  can  prove  not  only  the  sufficiency  of  the 
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Lindeberg  condition  but  also  its  necessity. 

In  order  to  facilitate  application,  we  restate  (1)  in  a  different  form. 
Let  Cg(R)  be  the  class  of  functions  g  such  that  g  and  g*  are  bounded 
and  continuous. 

PROPOSITION  1.  Let  X  be  a  random  variable  with  finite  variance 
a2  >  0.  If  Vartg(X)]  <  o2E[g'(X)]2  for  gc  C^(R),  then  X  has  a  normal 
distribution . 


To  see  that  (1)  and  Proposition  1  are  equivalent,  we  define 


U0(X) 


sup 


Var[g(X)] 


gccQm) 


a2E[g' (X) ] 2 


Var Ig(X) ] >0 


and 


UB(X) 


sup 


Var[q(X)] 


qtc'(K)  o2B[g’  tX>]  2 

B 


Var(g(x)]>0 

where  Cq(;r)  is  the  class  of  c“  functions  on  R  with  compact  support. 

Then  we  observe  that  by  definition,  U(X)  >  1  and  that  by  Theorem  2(i)  of 
Borovkov  and  Utev,  UQ(X)  *»  UB(X)  =  U(X). 

Since  we  are  interested  in  both  an  application  of  Poincare -type 
inequalities  and  a  new  proof  of  the  central  limit  theorem,  it  is  fitting  to 
examine  the  arguments  which  lead  to  (1)  and  hence  Proposition  1.  There  are 
three  different  proofs  of  (1).  Ail  begin  with  a  variational  argument.  After 
that  the  first  uses  the  method  of  moments  (see  Borovkov  and  Utev,  Theorem  3). 
The  second  uses  the  characteristic  function  (see  Chen  and  Lou  (1985,  Theorem 
2.1  and  Corollary  2.1).  The  third  uses  differential  equations  (see  Chen  and 


Lou,  Lemmas  4.1  and  4.2) 
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Lindeberg  condition  but  also  its  necessity. 

In  order  to  facilitate  application,  we  restate  (1)  in  a  different  form. 
Let  be  the  class  of  functions  g  such  that  g  and  g'  are  bounded 

and  continuous. 

PROPOSITION  1.  Let  X  be  a  random  variable  with  finite  variance 
a2  >  0.  If  Vartg(X)]  <  o2E[g'(X)]2  for  g  €  C B(R),  then  X  has  a  normal 
distribution. 


To  see  that  (1)  and  Proposition  1  are  equivalent,  we  define 


U0(X)  = 


sup 


Var [g(X) ] 


geC>)  a2Etg,(X,]2 


Var[g{X)] >0 


and 


vx) 


gup  v-£Mi)JL 

*c>)  °Etg,(xn 
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Var  [g(x)]  >0 


where  Cq(f)  is  the  class  of  c“  functions  on  R  with  compact  support. 


Then  we  observe  that  by  definition,  U(X)  >  1  and  that  by  Theorem  2(i)  of 
Borovkov  and  Utev,  Ug(X)  *  UB(X)  =  U(X). 

Since  we  are  interested  in  both  an  application  of  Poincar%-type 
inequalities  and  a  new  proof  of  the  central  limit  theorem,  it  is  fitting  to 
examine  the  arguments  which  lead  to  (1)  and  hence  Proposition  1.  There  are 
three  different  proofs  of  (1).  All  begin  with  a  variational  argument.  After 
that  the  first  uses  the  method  of  moments  (see  Borovkov  and  Utev,  Theorem  3). 
The  second  uses  the  characteristic  function  (see  Chen  and  Lou  (1985,  Theorem 
2.1  and  Corollary  2.1).  The  third  uses  differential  equations  (see  Chen  and 


Lou,  Lemmas  4.1  and  4.2) 


We  now  prove  the  sufficiency  of  the  Lindeberg  condition.  Let 


Xni . Xnr  ,  n  >  1,  be  a  triangular  array  of  row-wise  independent  random 

n 

2  2 

variables  with  zero  means  and  finite  variances  o  . , . . .  ,a  such  that 
_  _  nl  nr 

r  r  n 

n  n 

I  a  ■  1.  I^t  Wn  *  J  ^t\L  an(*  Wn  ^  ”  Wn  “  Xni*  By  'rheorem  2.1  of  Chen 
i=1  i-1 

( 1985), 

rn  W(i> 


Var (g( W  ) ]  <  )  e[ Var  n  g(W  )] 

i=1 


for  g  e  C1(1R)  such  that  |g(x)l  <  C( 1  +  |x()  for  some  constant  C.  Now  the 
riqht  hand  side  of  the  inequality  equals 
rn  w(i)  w(i) 

T  FE  n  [g(W  )  -  g(w(i))  -  E  n  (g(W  )  -  g(W(i)))]2 
n  n  n  n 


<  l  E[g(W  )  -  qw{i)))2 
i=1 


l  e[/  ni  gvw'11  +  t,dt]  2 
i=1 


<  J  E^Xni  /onl  + 


rn  r  c®  fi)  2  i 

=  l  e{x  J  [I(X  >  t  >  0)  -  I(X  <  t  <  0)][g’(w'  +  t)]  dt} 


=  V  r  E[g'(W(i>  +  t)] 2X(i)(t)dt 
i-1  '•*  n  n 


where  K^1,(t)  =  EX1[l(Xni  >  t  >  0)  -  KX^  <  t  <  0)]  >  0.  Define  the 


probability  measure  vn  on  BCEO  by 


,  r  rm  (i)  (i) 

f  fdv  =  >  f  Ff(w'  ,t)F'  ’ ( t ) dt 

R2  n  ^  n  n 

2  2 

for  bounded  and  continuous  functions  f  on  IR  .  Also  define  t!>  :  ir  ♦  IR 


by  iHx, y)  =  x+y.  Then,  combining  the  above  inequalities,  we  have  the 


^  T  J*  TV  t  «  ' 


following  Poincare-type  inequality  for  Wn  :  For  g  e  C1®)  such  that 
|g(x)|  <  C( 1  +  jx|)  for  some  constant  C, 

(3)  Var[g(Wn)]  <  (g’  )2dvn«\)»~1  . 

Let  v^^A)  -  vn(AxiR)  and  v£2)(A)  -  vn([RXA)  for  AeBOR).  Since 


r 


(  2  \ 

the  Lindeberg  condition  implies  that  '  **«*>  eQ,  the  Dirac  measure  at  0, 
as  n  -►  «.  Now  EW^1-*2  <  EW2  ”  1  for  each  n  anti  *  implies  that  {v^1^} 
is  tight  and  hence  relatively  compact.  Let  {v^JH  be  a  weakly  convergent 
subsequence  and  let  v^, '  =■=>  L(Z).  Then  vn,  — >  L(Q)  and  hence 
»">  L( Z).  It  follows  from  (3)  that  for  g  e  Cg(iR), 

(4)  Var [g(Z) ]  <  E[g*(Z)]2  . 


If  we  show  that  Z  has  a  standard  normal  distribution,  then 

L(Wn)  —*>  N( 0, 1 )  is  proved.  By  Proposition  1  it  remains  to  prove  that 

2 

Var(Z)  »  1.  First  we  observe  that  by  virtue  of  EW  =  1,  Z  is  square 

n 

integrable.  Let  ea  c  C^(IR)  be  increasing  such  that 

x  if  1 x |  <  a 


»a<x) 


a+1  if  x  >  a+2 
-a- 1  if  x  <  -a -2 


By  (3), 


Since 


-1 


VarlV  -  W>J  <  4R(1^i)2dVn-°'»'"1  * 

converges  weakly,  {vni°<J'~1}  ts  tight.  Therefore  for  e  >  0 
Var[Wn,  -  ?a(Wn,)]  <  4n^A)2dvn'°’,'~1  <  e 


V 


-4 


for  sufficiently  large  a.  Now  (Var) 


is  a  semi norm  and  so 


1/2 

ll  -  [Var(*a(Wn,))]1/2  |  .  |  [Var(Wn,)]1/2  -  [Var(*a(Wn, )  )]1/2  | 

<  [Var (W  -  *  (W  ,)]'2 

n  ’  an 

<  eV2  . 

By  letting  n"  +  •  and  then  a  +  »,  we  obtain 

|l  -  [Var ( Z ) ]^2  |  <  e^2  . 

This  implies  that  Var(Z)  ■  1  and  hence  L(Wn)  ■— >  N(0,1). 

We  now  prove  the  necessity  of  the  Lindeberg  condition.  First  we  need  two 
simple  propositions. 


PROPOSITION  2. 


The  Lindeberg  condition  holds  if  and  only  if 


(2) 


•>  e 


o* 


PROOF.  The  "only  if"  part  has  been  proved  above.  The  "if"  part  follows 
from  the  following  inequalities: 


r  r 

',X„ll,IXnll  -e,I'IXnll  >  *' 


*  2  EXnl,(IXnll  >  “>  • 


For  the  next  proposition  let  Cy(lR)  be  the  class  of  functions  g  on  IR 
such  that  g,  g’  and  g”  are  uniformly  continuous,  !g(x)|  <  C( 1  +  |x|) 
for  some  constant  C  and  g’  and  g’ ’  are  bounded. 


PROPOSITION  3.  Let  Z  and  T  be  independent  random  variables  such 

2 

that  Z  has  the  normal  distribution  with  mean  0  and  variance  o  >0.  If 
Var[g{Z)]  <  <J2E[g'  (Z+T)  ]  2  for  g  €  OR),  then  T  ■  0  w.p.  1. 


PROOF.  By  the  variational  argument  used  in  Borovkov  and  Utev  (1984)  or 


Chen  and  Lou  (1985),  we  have 

EZh(Z)  -  o2Eh*(Z+T) 

for  h  €  Cy(E) .  This  equation  also  holds  for  h(x)  *  x  by  approximating 

o 

this  function  by  functions  of  Cy(F).  So 

304  =  EZ4  *  302E(Z+T)2  . 

The  finiteness  of  E(Z+T)2  and  EZ2  implies  that  of  ET2.  By  expanding 
E( Z+T) 2  we  obtain  o2  =  02  +  ET2  which  implies  that  ET2  =  0  and  so  T  =  0 
w.p.  1. 

For  the  proof  of  the  necessity  of  the  Lindeberg  condition  we  need  the 

usual  assumption  that  for  every  e  >  0,  max  P(|x  |>e)  +  0  as  n-*-“. 

1<i<r 

n 

Let  g  €  Cy(R) •  For  every  e  >  0,  let  6  >  0  be  such  that  |(g'(x))2  - 

( g* ( y ) ) 2 1  <  e  for  lx-y|  <  6.  Then  we  have 
r 

n 

'  \  /"  E{(g'(w(i)  +  t))2  -  (g’(w  +  t))2}<l)(t)dtl 

r  rn 

(5)  <  cf  V  P(  | X  .1  >  6)  f°°  K(l)(t)dt  +  e  y  f”  KU)(t)dt) 

i^1  ni  ^  n  i-1  '  -  n 


<  cf  max  P( lx  . !  >  5)  +  el 
. .  •  >  n  x 

1<i<r 

n 

for  some  constant  C.  Define  the  probability  measure  on  8(B)  by 

r 

n  , . . 

r  fdv  =  y  /  mix'  '(t)dt 

1  n  l.  j  n 

i=1 

for  bounded  and  continuous  functions  f  on  K.  In  view  of  (5),  the 
inequality  (3)  can  be  written  as 

(6)  Va r [ g ( W  )]  <  J°*  E(g’(W  +t )  1  2  v  (dt)  +  o(1)  . 

n  -*ao  n  n 

Suppose  L(Wn)  ■■>  N(  0, 1 ) .  Let  (vn,l  be  a  subsequence  of  (v^  which 

rw  2 

converges  vaguely  to  a  subprobability  measure  v.  Then  for  g  e  Cq(IO,  the 


2 

class  of  C 
show  that 


functions  on 


IR  with  compact  support,  it  is  not  difficult  to 


E[g*(Wn  +  t)]2  vn,(dt)  +  fl*,  E[g'  (Z+T)  ]  2  v  (dt) 
where  Z  is  a  standard  normal  random  variable.  So  by  (6)  we  have 
(7)  Var [g(Z) ]  <  /“^  E(g' (Z+T)] 2  v(dt)  . 

By  approximating  Cy(IR)  by  functions  of  C2(3R),  (7)  holds  for  g  e  C2(IR) . 

By  letting  g(x)  =  x,  we  get 


/oo 

_  V(dt) 

■  CO 

and  so  v  is  a  probability  measure.  By  Proposition  3,  v  must  be  eQ. 

~  ~  { 2 ) 

Hence  v  — >  e  .  But  v  =  v  .By  Proposition  2,  the  Lindeberg  condition 
n  0  n  n 


holds.  This  completes  the  proof 
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